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Abstract 



In Descriptive Complexity, there is a vast amount of literature on decision problems, 
and their classes such as P, NP, L and NL. However, research on the descriptive com- 
plexity of optimisation problems has been limited. Optimisation problems corresponding 
to the NP class have been characterised in terms of logic expressions by Papadimitriou 
and Yannakakis, Panconesi and Ranjan, Kolaitis and Thakur, Khanna et al, and by Zi- 
mand. Gradel characterised the polynomial class P of decision problems. In this paper, 
we attempt to characterise the optimisation versions of P via expressions in second order 
logic, many of them using universal Horn formulae with successor relations. The polyno- 
mially bound versions of maximisation (maximization) and minimisation (minimization) 
problems are treated first, and then the maximisation problems in the "not necessarily 
polynomially bound" class. 

1 Introduction 

Though there has been abundant research in Descriptive Complexity since Fagin's 1974 the- 
orem |Fag74| (which captures the class NP as the set of properties that can be represented 
in existential second order logic), the application of this area to approximation complexity 
has been limited. Approximation complexity measures how well an NP-hard optimisation 
problem can be approximated, or how far is the value of a (possible) heuristic solution from 
that of an optimal solution. 

A few attempts to characterise approximation classes in terms of logic are: Papadimitriou 
and Yannakakis in 1991 |PY91j . Panconesi and Ranjan in 1993 |PR93j . Kolaitis and Thakur 
in 1994 and 1995 |KT94[ I!rr95| . and Khanna et al in 1998 |KMSV98| . 

The approximation complexity of a problem P is usually measured by the approximation ratio 
that a heuristic H for P can guarantee, over all instances of P. The approximation ratio 
Rh{I) obtained by H for a given instance I of P is given by 
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Rh{I) 



value obtained hy H on I 




value of an optimal solution for I 



In |KT941 IPR931 IPYQlj , the authors characterise approximation hardness in terms of quan- 
tifier complexity — the number and types of quantifiers that appear at the beginning of a 
second-order formula in prenex normal form (PNF). For a formula in PNF, all quantifiers 
appear at the beginning, followed by a quantifier-free formula. 

1.1 Contributions in this paper 

In this paper, we first present a logical representation of a subclass of P' — P' is the class of 
optimisation problems that can be solved to optimality within polynomial time^. The class 
of decision problems corresponding to P' is P. The particular subclass Q' (of P') that we 
focus on includes only polynomially bound optimisation problems, defined below in Definition 
^ In particular, we 

• provide syntactic characterisations for both maximisation and minimisation problems 
in Q', 

• give examples of characterisations (MAXFLOWp^ for maximisation and SHORTEST 
PATHpB minimisation), 

• show that MAXFLOWpp is complete for the maximisation subclass of Q', 

• present characterisations for maximisation problems in P' (defined in Tabled — prob- 
lems not necessarily polynomially bound), as well as an example for a problem in this 
class (MAXIMUM MATCHING). This is the most significant contribution in this pa- 
per, and is a considerable departure from the treatment in |Zim98j . Whereas that paper 
studied problems in N in general, we study maximisation problems in P'. 

The syntactic characterisation of P is given below in Theorem ^ as shown in Gradel |E. 91j . 

1.2 Notation and Definitions 

All notation is defined in Table as a one-stop reference point. For the same reason, all 
definitions are provided below in this section. 

Definition 1. An optimisation problem Q' is said to be polynomially bound if the value of 
an optimal solution to every instance I of Q' is bound by a polynomial in the size of I. In 
other words, there exists a polynomial p such that 

optQ,{I)<p{\I\), (2) 

for every instance I of Q' . The class of all such problems is Q'. 

^Strictly speaking, in Turing machine terminology, P' is the set of languages where, if an instance / of 
an optimisation problem P G P' is encoded as an input string x in some alphabet E, a deterministic Turing 
machine will compute the optimal solution (which is again a string) within 0(|a::|''') steps, where k is some 
constant and |a;| is the length of the input string. 
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a 


vocabulary 


A 


a structure defined over a (captures an instance of an optimi- 
sation problem) 


V 


a quantifier-free first order formula, and a conjunction of Horn 
clauses at the same time. (Recall that a Horn clause contains 
at most one positive literal.) 


X 


an m— tuple of first order variables 


S 


a sequence of second-order variables (predicate symbols) 
(captures a solution to the optimisation problem) 


P 


computational class of decision problems, decidable in polyno- 
mial time by a deterministic Turing machine 


P' 


class of optimisation problems corresponding to P 
(also called P— optimisation problems) 


Q' 


Q' C P', and Q' only contains polynomially bound optimisation 
problems (see Definition^ 


N 


class of optimisation problems whose decision versions are in 
NP 


N' 


N' C N, and N' only contains polynomially hound optimisation 
problems 


ESO 


Existential Second Order Logic 


PNF 


Prenex Normal Form 



Table 1: Notation 
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Definition 2. First order logic consists of a vocabulary (alias signature) a, and models 
(alias structures) defined on the vocabulary. In its simplest form, a vocabulary consists of 
a set of variables, and a set of relation symbols Rjil < j < J), each of arity rj. A model 
M consists of a universe U whose elements are the values that variables can take — M also 
instantiates each relation symbol Rj G a with tuples from U^^^\ For example, a model G in 
graph theory may have the set of vertices G = {1, 2, • ■ ■ 10} as its universe (assuming that the 
graph has 10 vertices), and a single binary relation E where E{i,j) is true iff{i,j) is an edge 
in the graph G. A model represents an instance of an optimisation problem. 

Definition 3. A IIi first order formula in PNF only has universal (existential) 

quantifiers, quantified over first order variables. 

Definition 4. An existential second-order (ESO) Horn expression is of the form 3SV', 
where ip is a first order formula, and S = {Si, ■ ■ ■ Sp) is a sequence of predicate symbols not 
in the vocabulary of ip. The formula Tp can be written in Hi form as 

ip = VxiV2;2 • • • Vxfc?7 = Vx rj. (3) 

where rj is a conjunction of Horn clauses (rj is, of course, quantifier-free) , and Xi {\ < i < k) 
are first order variables. Each clause in r] contains at most one positive occurrence of any of 
the second order predicates Si (1 < i < p). 

Definition 5. A II2 (S2) formula in prenex normal form (PNF) can be written as 
follows: 

= Vxi • • • Vxa 3yi • • • 3yh rj ((/) = 3yi • • • By,, Vxi • • • Mxa v), (4) 

where rj is quantifier- free, a,b> 1, and the x's and y's are first-order variables. 

The following theorem is due to Gradel |E. 91j — this is the polynomial-time counterpart of 
Fagin's theorem Fag74| which characterised the class NP: 



Theorem 1. For any ESO Horn expression as defined in Definition^ the corresponding 
decision problem is in P. 

The converse is also true — if a problem P is in P, then it can be expressed in ESO Horn 
form — but only if a successor relation is allowed to be included in the vocabulary of the 
first- order formula ip. 



2 Polynomially Bound Optimisation Problems Q' 

Optimisation problems corresponding to P. We assume that for a maximisation (or a 
minimisation) problem Q' in the class Q' (corresponding to the class P of decision problems), 
the following can be computed in polynomial time deterministically: (a) The value of the 
objective function /(A, S) to a solution S of an instance A, and (b) Whether a solution S is 
a feasible solution to an instance A. 

We will study maximisation problems first, and then the minimisation problems. 
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2.1 Polynomially Bound P-Maximisation Problems 

For maximisation problems in N' (see Tabled for a definition of N'), Kolaitis and Thakur 
|KT94j proved the following: 

Theorem 2. A maximisation problem Q € N' z/ and only if there exists a II2 first order 
formula (/>(w, S) with predicate symbols from the vocabulary a (of 4>) and the sequence S, such 
that for every instance A of Q, the optimal solution value is given by 

optqiA) = max | {w : (A, S) ^ H^, S)} |. (5) 



In other words, polynomially bound NP-maximisation problems fall in what is called the 
MAX 112 class. We can show a similar result for the polynomial-time counterpart of N', that 
is, maximisation problems in Q': 

Theorem 3. Let A be a structure (instance) defined over a. The value of an optimal solution 
to an instance A of a maximisation problem Q' can be represented by 

optQi{A) = max |{w : (A, S) ^ Vx r]{w,x, S)}| (6) 
s 

if Q' £ Q' , where x, A, S and rj are defined in TableU[ 

Proof. Let Q and Q' be the decision and optimisation versions respectively. 

We first show that Q' is polynomially bound. For this, the number of tuples w in an optimal 
solution S* should be polynomial in 1^41, the size of the universe of A. It suffices to show 
this for any solution S. Recall from Table ^ that the sequence S of predicates captures a 
(corresponding) solution S to the optimisation problem. 

Suppose w is a 7^— dimensional tuple. For a given universe A of A, the number of possible 
tuples is — this is true for any solution S including the optimal one. Hence Q' is 

polynomially bound. 

To complete the proof, we should show that if Q' £ Q', then the optimal solution value to an 
instance A of Q' can be represented by equation ©. 

Refer to Gradel's theorem (Theorem The decision problem Q can be written as an ESO 
Horn expression BSV', except that now, ip should include a successor relation in its vocabulary, 
in addition to being a Horn first order formula. Problem Q can be posed as: Given an instance 
(a finite structure) A, is there a feasible solution S such that /(A, S) > K, where K is a 
certain integer ? 

(Here / is the value of the objective function to solution S for the optimisation problem. 
Assume that we deal only with problems with integer- valued objective functions.) 

A feasible solution S could consist of several relations Si, S2, ■ " Sp of arities ri,r2, • ■ ■ rp. The 
formula ip should be able to express /(A,S) > K — however, this is insufficient. Given a 
solution S, we know that all feasibility conditions, including /(A, S) > K, can be checked 
in polynomial time deterministically for all problems in NP. What distinguishes P from 
NP is the fact that for optimisation problems corresponding to P, the optimal solution value 
optQi(A) can be computed in polynomial time deterministically, whereas for optimisation 
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problems corresponding to NP, we only know that this value can be computed in polynomial 
time non-deterministically. 

Hence this condition should be modified to g{A) = optsf{A.,S) > K, where g{A) is the 
optimal solution value to instance A over all solutions S. Each of the g{A) number of 
entities can be considered to be a tuple Wi, and thus we need at least K such tuples. These 
tuples, at least K in number, can be defined to form a new relation F (on the universe A of 
A) of arity k. Thus we want \F\, the number of tuples w that satisfy F(w), to be at least K. 

Digression to discuss arity k. As examples, setting k = 2 will suffice for the LONGEST 
PATH problem where the number of arcs in a path is to be maximised, and /c = 1 in a 
MAXSAT problem where the number of satisfying clauses is to be maximised. However, 
this can handle only up to very small values of the objective function. In the LONGEST 
PATH case, we can only count \A\'^ tuples at most (where \A\ is the number of vertices in the 
graph). However, if arc lengths are higher than one, but still polynomially bound in \A\, the 
length of the longest path — though still polynomially bound in |^| — could be well above 
I^P, and this length cannot be handled by an arity of k = 2 — a higher arity is required. 
Hence it would be safest to increase the arity to TZ, since \A\^ is the upper bound on the 
objective function value. A similar argument applies to the weighted MAXSAT problem with 
polynomially bound weights — and to all polynomially bound NP-maximisation problems in 
general. 

Recall from Theorem^that ip is in Hi form, where the quantifier-free part of '(/' is a conjunction 
of Horn clauses, each of which contains at most one positive occurrence of any'^ of the relation 
symbols Si. Hence ijj can be written as Vxi, • • ■\/xmf}, where i) is an expression consisting of 
variables xi, • • ■ Xm, all predicates from S, and the the relation F. That is, 

^ = Vxi • ■■Mxjn f}{xi, • ■ ■ S) = Vx f)(x, F, S), (7) 

where ?} is a conjunction of Horn clauses {t) captures the feasibility of solution S) and x = 
(xi, • • • ,Xm)- Note that r) needs to capture two types of conditions (an example with such 
conditions is provided in the next section): 

(a) Global conditions (those that apply over all w tuples): Such conditions express the fact 
that the solution (S, F) as a whole is a feasible solution to A. One such condition is 
\F\> K mentioned above. And 

(b) Local conditions: The ones that are specific to a given w — if F{w) is true, that is. 

Thus ?7 is a conjunction of these two types of conditions. The global conditions'^ can be written 
as f)i, and the local conditions as Vw F(w) — > ?/2(x, w, S). So Q, the decision problem, can 
be written as 3S 3F Vx fji A [Vw F(w) — > fi2{x,w, S)]. In prenex normal form, 

Q = 3S3F Vw Vx fji A [F(w) — > ??2(x, w, S)] 

= 3S 3FVw Vxf/i AhF(w) Vf)2(x,w,S)] (8) 

^For example, if and S2 are second order predicates, then a Horn clause cannot contain both Si and S2 
as positive literals. 

•^Observe that 771 captures the cardinality condition |_F| > K. To represent this, we can define a first-order 
relation G of arity k over the universe A of A such that \G\ = K and _F(w) is true whenever G(w) is. Then 
we need to represent the fact that |F| > \G\, which can be characterised as Vw G(w) — > F{w). 
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If fji and fi2 are each a conjunction of Horn clauses, then so is the formula in Q. If we let 
?7(x, w, S, F) = fji A [-^F(w) V ?72(x, w, S)], then © can be rewritten in ESO Horn Hi form 
as 

Q = 3S3F Vw Vx [r/(x, w, S, F)] . (9) 

To express the optimal solution value for Q', we maximise over all feasible solutions S — and 
for each solution, count the number of w tuples for which the relation F{w) and ^/'(w,S) 
hold^: 

optQ,(A) =max|{w: (A,S,F) ^ [Vx r}(x, w, S)] A F(w)} [. (10) 

In (|lUjl . Vx f/(x, w, S) represents the feasibility of the given instance A. 

If S and F can be represented by a single sequence of relations T = 
{Si, S2, • ■ ■ Sp, F), the optimal solution value can be expressed as 

opiQ,(A) =max[{w : (A,T) ^ Vx r/(x, w, T)} [ (11) 

where Ty(x, w, T) = i7(x,w, S) A F{w). (fj and r/ are quantifier-free.) Since 77 is Horn, so is 
V- 

Hence the proof. □ 



2.1.1 Example: Polynomially Bound Maximum Flow (Unit Capacities) 

In this section, we will see how the MAXFLOW problem with unit capacities can be expressed 
in ESO, in Hi form. Given a source s and a sink t, and a network G containing directed 
edges, we want to find the maximum flow that can be sent through the network from s to 
t. Essentially we seek the maximum number of edge-disjoint paths from s to t. Call this 
(polynomially bound) problem MAXFLOWps. 

We want to determine the maximum number of vertices w (one dimensional tuples) to which 
there is a flow from s, along the edge {s,w) (if such an edge exists) — this will give us the 
value of the maximum flow from s to t. Every s — t edge-disjoint path can be considered as a 
partial order on the set of vertices. We will use ideas similar to those used in the expression 
for REACHABILITY |Pap94| . 

To represent the partial orders, introduce a second-order ternary predicate P{x, y, w) which 
holds iS X y and there is an edge-disjoint path from s to li; to 2; to y, in the feasible solution 
— the path from s to w, is of course, just a single edge. (The "main arguments" for P are x 
and y — w is just an additional reference.) Thus we seek the maximum number of w^s such 
that P{w, t, w) is true. The following expressions capture the properties of a feasible solution. 

(1) If P{xi, X2,w) holds, then so does G(s, w) — that is, the edge {s,w) is defined in G: 

4>i = yxiix2'iw P{xi,X2-,w) — > G{s,w) 

(12) 

= VxiVa;2Vw -iP(xi, X2, tt;) V G(s, u)). 

''Out of the four possible cases (i) (Vxj)) A F(w), (ii) (Vx?)) A ^F(w), (iii) (-iVx?)) A F{-w), and (iv) 
(-iVx?)) A -i-F(w), cases (ii) and (iv) must be disregarded since F is false. Case (iii) should also be disregarded 
since it violates the feasibility condition Vx?). 
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(2) An edge (i, j) can be a part of only one s — t disjoint path (equivalently, only one w — t 
edge disjoint path): 

4)2 = yiii^wi\/w2 P{i,j,wi) A P{i,j,'W2) A G{i,j) — > {wi = W2) 

(13) 

= \/i\/j\/wi\/w2 -'P{i,j,wi) V 1^2) V -^G{i,j) V {wi = W2)- 

(3) P is non-reflexive: 

4>3 = VyiVy2 -^P{yi,yi,y2)- 

(4) P is transitive: 

4)^ = yUiiU2yU3yW3 P{ui,U2,W3) A P{U2,U3,W3) > P{ui,U3,W3). 

= V'UiVu2V'U3Vl(;3 ^P{ui,U2,W3) A^P{u2,U3,W3)y P{ui,U3,W3) 

(5) And finally, any two adjacent vertices in P should also be adjacent in G: 

4>5 = MziMz2"^Wi P{zi,Z2,Wi) A ^Z3 ^[P{zi,Z3,W4) AP{z3,Z2,Wi)] >G{zi,Z2) 

= V2:1VZ2VZ3VW4 P{zi,Z2,Wi) A^[P{zi,Z3,W4) A P{Z3,Z2,W4)] >G{zi,Z2) 

= V2;1VZ2VZ3VW4 ^P{zi,Z2,W4)y [P{zi,Z3,W4) A P{z3,Z2,W4)]y G{zi,Z2) (16) 
= yZi\/Z2\/Z3\/W4, [-^P{zi,Z2,Wi)\J P{zi,Z3,Wi)\J G{zi,Z2)\ 
A [-^P{zi,Z2,W4) V P{z3,Z2.,Wi) V G{zi,Z2)]. 

5 

Let $ = /\ 

j=i 

Observe that each (1 < i < 5) is a Hi Horn formula, as required by Theorem |31 The 
optimal solution value to the given instance (network G, represented by a structure A), is 
given by 

ojitQ(A) = max|{ti; : (A,P) ^ P{w,t,w) A . (17) 

Discussion. In most such expressions as above, there are two types of conditions to be 
expressed: (1) Global conditions (those that apply over all w tuples), such as the expression 
$ above and (2) Local conditions (the ones that are specific to a given w), such as P{w,t,w) 
above. The first (second) set of conditions correspond to constraints (objective function) in 
a classical mathematical programming framework. 

It is clear that MAXFLOWp^ can be expressed with neither Eq nor Si formulae. In partic- 
ular, without universal quantifiers, none of the five properties — expressions (|12|) to (|16j) — 
can be expressed independently of the size of the instance. 

Consider property 4>2, for instance. Using only existential quantifiers, one should enumerate 
the property individually for each edge. However, this will make the length of (p2 dependent 
on the number of edges in the graph. Hence we can conclude that 

Proposition 4. The property, that an edge belongs at most one edge- disjoint s — t path in 
a solution, (and hence the MAXFLOWps problem) can be expressed with a Hi formula, but 
not with a Til formula. 
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(14) 
(15) 



2.1.2 MAXFLOWpB is Complete for Polynomially Bound Maximisation 

We can show that the MAXFLOWp^ problem is complete for the class of polynomially bound 
maximisation problems by reducing an instance I of a general problem Q' in this class to an 
instance Z of MAXFLOWpp. If I is represented by a structure A, the optimal solution value 
to / is given by Theorem |31 

optQ/(A) = max|{w : (A,S) ^ $}|, $ = Vx 7/(w, x, S) (18) 

if Q' € Q', where x, A, S and rj are defined in Tabled (Recall that r/ is a conjunction of 
Horn clauses and quantifier- free, and S is a sequence of second order predicate symbols.) 

Let the arity of w (x) be k (m) respectively. The diff'erent possible w (x) tuples are Wi, 
1 < i < (xj, 1 < j < n™), where n is the cardinality of the universe A of A. For a given 
Wi, the expression for <I> in (|18j) can be rewritten as 

^'(wi) = Vxr?(wi,x,S) = /\ r?(wi,Xj,S). (19) 

Instance X consists of n'^ + 2 vertices — one for each w; tuple, as well as two additional 
vertices s and t. Add a directed edge with unit capacity from s to each w; vertex. Add a 
directed edge with unit capacity from each Wi vertex to t iff' <I*(wi) holds. 

The reduction is polynomial time — 0{n^) time to create the vertices, and 0{n^~^^) time to 
add the edges. It is clear that instance I of MAXFLOWp^ has a maximum flow of a units 
from source s to sink t iff the optimal solution value to / in (|18|) is also a. 



2.1.3 A Problem in MAXpSo ? 

Within the class Q' (see Table^, let us define the class MAXpSg (MAXpIIi) as the class of 
polynomially bound maximisation problems that can be expressed by a Sq (Hi) formula. 

Kolaitis and Thakur showed that MAX3SAT is in MAXjvpSo (defined similar to MAXpSo), 
a sub set of N'. On the other hand, MAX2SAT is not known to be polynomially solvable 
|H97| . though the decision version, as to whether all clauses are satisfiable, is well-known 
to be in P |GJ79j . Results similar to MAX2SAT for both the maximisation and decision 
versions are also known for HORNSAT (where every clause is required to be a Horn clause) 
|KKM94j . 

Towards the goal of obtaining a hierarchy within the polynomially bound P-maximisation 
class, we need to exhibit a problem in MAXpSq. However, we have been unable to find such 
problem so far. 

We conjecture that as long as a successor relationship or a linear ordering on the universe of 
a structure is necessary, a problem cannot be expressed in MAXpEg (since this will require 
a Hi expression). 
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2.1.4 Hierarchy Within Maximisation 

We state the hierarchy within the polynomiahy bound maximisation class without a formal 
proof (since it is clear from the argument below): If a maximisation problem exists in the 
class (or the Ho class), then the Sq class is strictly contained within the IIi class. 

It is clear that the problem considered in Section 1!^. 1.11 MAXFLOWps, cannot be expressed 
with a Sq formula. In particular, without quantifiers, none of the five properties in Section 
12.1.11 — expressions (fT^ to 1)16(1 — can be expressed. 

Prom Section 12.1.21 clearly MAXFLOWpp serves as a complete problem for the MAXpIIi 
class. It would be desirable to obtain a complete problem for the MAXpEo class. An inter- 
esting observation is that the decision version of the weighted MAXFLOW problem (where 
arc capacity can be any non-negative integer) is complete for the class P |(tHE,95( IImm99] . 

Another question to be answered is, is there a class MAXpSi which is between MAXpIIi 
and MAXpSo ? 

2.2 Polynomially Bound P-Minimisation Problems 

For minimisation problems in N', Kolaitis and Thakur |KT94j proved the following (see Table 
n for a definition of N' and Definition [S] regarding S2 formulae) : 

Theorem 5. A minimisation problem Q' € N' if and only if there exists a E2 first order 
formula (/>(w, S) with predicate symbols from the vocabulary a (of (j)) and the sequence S, such 
that for every instance A of Q' , 

opiQ,(A) =mm|{w: (A,S) h(A(w,S)}|. (20) 



In other words, they showed that all polynomially bound NP-minimisation problems fall in 
what can be called the MIN E2 class. In the same paper, they also showed that this class is 
equivalent to the MIN Hi class. 

A similar result can be shown for minimisation problems in Q' (the polynomial-time equivalent 
of N'): 

Theorem 6. Let A be a structure (instance) defined over a. If Q' is a minimisation problem 
in Q', then the value of an optimal solution to an instance A of Q' can be represented by 

optQ:{A) = min | {^n : (A, S, F) ^ Vx r} | (21) 

where t = r](w,x,S) A -F(w), and x, A, S, r] are defined as in Tabled (The symbol F is a 
k—ary relation defined on the universe \A\ of A, since each w is k— dimensional.) 

Proof. The proof that Q' is polynomially bound is the same as in Theorem |31 

We start with Gradel's Theorem (Theorem^. The decision problem can be represented by 
an ESO Horn expression 3Sip where S is a sequence of predicate symbols, and ■0 is a Hi first 
order Horn expression where a successor relation is included in the vocabulary of '0. 
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The analysis for the decision problem Q is similar to the maximisation case, except that one 
looks for at most K tuples that satisfy the feasibility condition 7]{x,W, S)^. 

Thus for the minimisation version of the problem, an optimal value to an instance A can be 
written as 

optQ>{A) = min I {w : (A, S) ^ 0(w, S)} | (22) 
where (p{w, S) = Vx ?7(x, w, S). 

The w tuples can be considered as a k—aiy relation F such that F(w) is true if and only if 
w € Hence (/>(w, S) in H22|) should be modified to Vx 7]{x, w, S) A ^(w). The number of 
tuples \F\ in F should be minimised. 

Again, out of the the four cases (i) (Vxr/) A F{-w), (ii) (Vx?]) A -iF(w), (iii) (-iVx?]) A -F(w), 
and (iv) (-iVxr/) A -'F(w), cases (ii) and (iv) should be disregarded since F is false, and (iii) 
violates the feasibility condition Vxr/. This leaves us with the following modification of H22() : 

optQ,{A) = min | {w : (A, S) ^ (Vxr/) A F{w)} \ (23) 

Note that (Vxry) A ^(w) = Vx(F(w) A 7]). Since rj is Horn, so is {F{w) A rj). 

It may appear that the minimisation in (|23|) will always result in an optimal value |-F| of 
zero, but since the minimum value is taken only over all feasible solutions {S,F), the value 
obtained in (|23j) is correct. Hence the proof. (The minimisation over "only feasible solutions 
(S,-F)" needs further illustration and is provided below.) □ 

Illustration of Minimisation over "only feasible solutions (S,F)". To illustrate this 
point, consider the SHORTEST PATH problem in Section r2.2.11 We attempt to minimise the 
number of edges in a path from the source s to the sink t. If we minimise over any (S, F) 
combination, obviously this minimum number would be zero — however, this would violate 
the feasibility condition cpi that there exists a path from s to t. Hence this "zero" solution is 
obviously infeasible. 

Consider another example, MIN SET COVER. We are given a ground set X and several 
subsets Yi, Y2, Yq of X. Let C = {Yi, Y2, Yq}. The problem is select a few 
(minimum number of) subsets Yi from C, such that the union of the selected subsets is X. 
We can associate a unary tuple Wi to each Yi, such that the number of such w tuples in a 
solution is to be minimised. Let a second order predicate S{wi) determine if a certain subset 
Yi is chosen in a solution S. Obviously if we minimise over all possible S, the minimum 
number of Yi subsets selected will be zero — but then, such a solution is clearly infeasible, 
since the union of the selected subsets (zero of them!) is not equal to the ground set X. 

Discussion. From Theorems I21E1 and I3I6| the following can be observed in the case of 
polynomially bound optimisation problems: 

While second-order expressions are able to distinguish clearly between NP-maximisation and 
P-maximisation problems (n2 for the former and Horn Hi for the latter), the distinction is 
less clear between NP-minimisation and P-minimisation problems (Hi formulae in both cases, 
the only distinction being the Horn clause requirement in the P-minimisation case). 

^ which is the same as looking for at least — if + 1 tuples that do not satisfy J7(x, W, S). 
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2.2.1 Example: Shortest Path 



We now provide an example of a polynomially bound P-minimisation problem, SHORTEST 
PATHpB. Assume that the edges have unit weight and are directed. The number of edges in 
the shortest path is to be minimised. The decision version of this problem is easily represented 
as a Si formula: 



3rEi3a;2 • • • 3xfc G{s, xi) A G{xi,X2) A • • • A G{xf^_i,Xk) A G{xk,t) 



(24) 



where s is the origin and t is the destination. The above formula says that there is a path 
from s to t of length /c + 1, and it is a Horn formula. {G{x,y) is true if there exists an arc 
{x,y) in graph G.) We have not used any second-order variables in (|24|) . hence the decision 
version is FO (first order) expressible. 

Minimisation version. A shortest path (or any path from origin to destination) represents 
a partial order P on the universe (the set of vertices) — P is represented by a second order 
(SO) binary predicate. Another SO binary predicate S chooses which arcs in the network are 
in the required path. Again, we will use ideas similar to those used for REACHABILITY 



Pap94 . The following formulae express the properties of P and S: 



(pi = P{s,t) = rji (there exists a path from s to t). 

(P2 = Vx Vy Vz r]2, V2 = [iP{x,y) A P(y,z)) ^ P(x,z)] (P is 
transitive.) 

(ps = Vx Vy 773, r)3 = -^P{x, x) A [{P{x, y) -^P{y, x)] 

(P is neither reflexive nor symmetric.) 
(/)4 = Vx Vy 7/4, r/4 = S{x,y) — > [G{x,y) A P{x,y)] (If an edge is 

chosen by S, then it has to be in the given graph G and in the 

s — t path P.) 

(p5 = Mx My r}5 with 775 = P{x,y) [S{x,y) V 3z{P{x,z) A S{z,y))], 
(Recursive definition of P — either there is an (x, y) arc, or 
there exists a path from x to z and a {z,y) arc.) 

(j)Q = yxMyMz r]Q, r]Q = [{S{x,y)AS{z,y)) ^ {x = z)] (Predecessor 
is unique, hence there is a unique path P from s to t.) 



It can be shown that each 77i (1 < i < 6) above is equivalent to a Horn clause — clauses with 
at most one positive literal from the set of second order variables^ {P, S} — or a conjunction 
of such clauses, as required by Theorems ^ and The optimal solution value for instance G 
can now be written in Horn Hi form as 



opt{G) 



mm 

P,S 



(p, q) : (G, P, S) h Vx Vy Vz /\ ??, A Sip, q) 



(25) 



Discussion. Though we have not proved it, SHORTEST PATHpp could be one of those 
problems where the decision version can be represented in Si form, but the optimisation 
problem is in Hi form. It would be interesting if this observation (hierarchy in terms of 
quantifier complexity) could be proven or disproven. 

''A clause such as P{x,z) V S{s,t) cannot be a Horn clause, for instance. 
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3 Optimisation Problems in P' 



We next turn our attention to the class P'. This is the set of all optimisation problems, 
not necessarily polynomially bound, but the optimal solution can be computed within time 
polynomial in the size of the input. (These problems need not obey Equation [51) 

Zimand 1998 |Zim98j generalised Theorems |21 and to all NP-Optimisation problems, not 
just those that are polynomially bound. He showed that a 112 first-order formula captures 
the feasibility conditions for any problem in this class, while the optimal solution value can 
be represented by a maximisation (or minimisation) over weighted tuples — the tuples are 
similar to those used in expressions (fTT|l and for polynomially bound problems, but now 
they are also assigned real number weights. The method of attaching weights to tuples has 
also been discussed in Papadimitriou and Yannakakis 1991 |PY91) . 

We will demonstrate (without a formal proof) that Zimand's result can be extended to 
polynomial-time maximisation problems as well. Zimand shows that for any positive in- 
teger value z for an optimal solution, we can compute a set of weights Cj that are powers of 
two, such that z = q. However, for a given optimisation problem Q', the weights on tuples 
are given quantities, such as the arc capacities in a MAXFLOW problem or the arc costs in 
a TRAVELLING SALESPERSON problem — the weights are part of the input. (Zimand 
makes no attempt to relate his computed weights with the input weights.) 

Gradel's Theorem states that any decision problem Q € P can be represented as 

Q = BStp. (26) 

A decision version of a maximisation problem asks if there is a solution S to an instance I 
(represented by a structure A) such that the objective function /(A, 5) > K where K is a 
given constant. 

Motivation to attach weights to tuples. If / is a YES instance to Q, then must be 
able to express the fact that /(A, 5) > K using a finite structure, according to Gradel's 
Theorem. The quantity /(A, 5), though not polynomially bound in the size of I any more, is 
still a finite quantity. For a structure A with universe A, the number of k—aiy tuples possible 
is l^l'^, which is polynomial in the size of the instance. In other words, f{A,S) need not 
be polynomially bound, whereas the maximum number of w tuples should be — this is in 
contrast to the problems in Sect. |2 One way to capture a larger number (/(A, 5)) using a 
smaller one (the number of w tuples) is by attaching weights to the tuples. 

For example, in the MIN CUT problem (dual of MAX FLOW), the tuples (arcs) are binary, 
and the weights of these tuples are the arc capacities. In WEIGHTED MAX3SAT, the tuples 
(clauses) are ternary with a weight attached to each clause. In WEIGHTED MAXSAT, it 
is unknown how many literals are in each clause, hence a unary tuple is commonly used 
|KT94j . In TRAVELLING SALESPERSON (TSP), the weights on the binary tuples are the 
arc costs. It may be undecided ahead of time how the optimal value to a problem in P' can 
be represented, as to which set of tuples and their weights will be used — for example, the set 
of edges used in a solution to the TSP is unknown until a solution is determined. However, 
the number of such sets and their tuples are finite — and the weight of each tuple is a given 
quantity. 
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Naturally, each set of tuples described above can be said to form a relation Ui over the universe 
of A, and the set of all such relations can be represented by U. For Ui G U, its weight w{Ui) 
is defined as 

wm = J2 ^(w), (27) 

wee/, 

where w{'w) is the given weight of tuple w. For example, in a TSP instance with five vertices, 
each Ui will contain five tuples (the five arcs in the solution). However, in SHORTEST PATH, 
the number of tuples in Ui depends on the path (solution) used — hence the cardinality of 
the different C/j's is not the same, since the number of arcs in each solution can vary. 

Furthermore, the universe A should consist of values (such as vertex indices in graphs) for 
the variables, as well as weights for the tuples^. A unary relation C{x) — sometimes known 
as a hidden relation — decides if a given variable is a basic variable, or a weight for one of 
the tuples. The universe A of a structure A will be of the form 

A = {ai,a2r--an,'Wi,W2r--Wm} (28) 

where the Oj's are possible values for the basic variables and the wj^s are possible weights for 
tuples of basic variables. For any variable Xj, the following expression 4>i{xi) should hold: 

n 

Mxi) = [C(x,)] ^ V = "il- (29) 

Introduce a relation R{xo,xi,X2,- " ^k) which holds true iff xq is a weight for the tuple 
w = (xi, X2, • • • Xk) — hence C(xo) is false, and all other C(xj)'s are true. In an instance, the 
variable xq is instantiated with a weight Wi from A. 

3.1 Maximisation Problems 

Reverting to Gradel's expressibility in H26() . since is in Hi ESO Horn form, it can be written 
as (just like the case for polynomially bound problems), 

ilj{x,Wi,S) = VxiVa;2 • • • Vx^ r}(xi,X2, • • -Xm, Wi,5) = Vx ?7(x, Wi,5) (30) 

where x = (xi,X2, • ■ ■ Xm) — hence 57 should express the fact that w{Ui) > K, and f) should 
include expressions for every </>i(xi) in (|29() . If a certain relation Ui that satisfies the feasibility 
conditions exists, then 

Q = 3S 3Ui Vw Vx [Ui (w) < — ^ f/(x, w, 5)] , (31) 

where Q is the decision problem. From this, the value of the optimal solution (for the 
optimisation problem Q') can be expressed as 

optQ>{A)=max{w{Ui) : {A,S,Ui) ^ VwVx [[/^(w) < — . f/(x, w, 5)]} . (32) 

(A, 5, Ui) above also satisifies expressions where Ui{w) and fi(x,w,S) are false — however, 
since Ui{w) is false, the weight of this tuple w will not be counted in w{Ui). 

Note that (|32() need not be a Hi Horn formula any more, since the Horn property of 
-i?7(x, w,5) is unknown: 

[U,{w) ^ ??(x,w,5)] = [Ui{w) V -7?(x,w,5)] A hC/i(w) Vf)(x,w,5)] . (33) 

^This is a variant of Many-sorted Logic. 
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3.2 Example: Weighted Matching 



Here, we provide an example of how WEIGHTED MATCHING (optimisation version) can 
be expressed. Given a graph G with weights on the edges, the objective is to mark certain 
edges such that the sum of the weights on the marked edges is maximised, with the condition 
that no two adjacent edges in G can be marked. (In the context of this problem, a Matched 
edge is a synonym for a Marked edge.) An instance (structure) A consists of 

(a) the universe A (the union of the set of vertices and the set of tuple- weights) , 

(b) a relation G (the set of edges) , 

(c) a relation C{x), which defines whether a variable is a vertex or the weight of a tuple, 

(d) and a ternary relation R{xo, xi, X2) that decides whether an edge {xi,X2) is assigned a 
weight of xo • 

Let relation U{vi,Vj) be true if {vi,Vj) is a matched edge. Obviously it can be a matched 
edge only if the edge exists in the given graph. This is expressed by 00 below. If edge {vi,Vj) 
is matched and x is a vertex not in {vi, Vj}, then an adjacent edge G{x, Vi) (if it exists in the 
given graph) cannot be matched. This is expressed by The three other expressions 4'2, 
(ps and (/>4 perform the same task. 

(pi = U{vi,Vj) G{vi,Vj), 

T = {x ^ Vi) A {x ^ Vj) A U{vi,Vj), 

(pl = T A G{X, Vi) -.[/ {x, Vi), <j)2 = T A G{vi, x) -.[/ {vi,x), 
(pS = T A G{x, Vj) — > -if/ {x, Vj), (p4 = T A G{Vj,x) -if/ {Vj,x). 

Let set of weights B = {z € A \ 3x3y U {x, y) A R{z, x, y)} — however, since B is a set, if the 
same weight is assigned to two or more edges in U, only one of them will be counted towards 
total edge weights. Thus there is a need to split B into Bi (1 < i < m, m = number of edges 
in the input) — a weight vo in Bi occurs among i edges in U. Hence^ 

Bi = {z e A \ 3x 3y\fu\fv T AU{x,y) AR{z,x,y)} , (34) 
where r = {[{u ^ x) y {v ^ y)] A U{u,v)} ^ ^R{z,u,v). (35) 

(Note: In the definition of B^ below, 3f^iXi is a shorthand for 3xi 3x2 ■ • • ^a;^.) 
In general, any Bj. {1 < k < m) can be expressed as 




k k ^ 

G A I 3^^^Xi 3*Liyi Vn Vi; f\ U{xi,yi) f\ R{z,Xi,yi) A r i 

1=1 1=1 J 



(36) 



n = l/\[iu,v) / {xi,yi)]AU{u,v)\ ^ ^R{z,u,v), (37) 



.i=l 



r2 = /\{xi,yi) / {xj,yj) (38) 



^Issues such as quantifier complexity and Horn property are irrelevant for the logic expressions in 134t - ll39t . 
Logic expressions are used here for the sole purpose of defining Bk, 1 < k < m. 
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{ixi,yi) ^ {xj,yj)} = {{xi / Xj) V (y^ / yj)} . (39) 

Expression (|55|) says that there are k distinct edges {xi,yi). Expression is an explanation 
of the shorthand notation used in (|H7j) and (jHH|) — that if two edges are different, then at 
least one of their endpoints should be different. 

The weight of relation U, 'w{U), is computed as: 



w 



Finally, $ is the expression that a solution U should satisfy, and the optimal solution value 
is obtained by maximising over all such solutions: 

4 

$ = Vx [C{vi) A Civj) A C{x)] ^ /\ (/.fc, (41) 

fc=0 

optQ,{A) = max {w{U) : {A,U) ^ . (42) 



4 Future Research 



The open question — mentioned in the proof to Theorem |31 — of how to express decision 
versions of optimisation problems in the Hi form specified by Gradel for problems in P in Sect. 
12.11 needs resolution. A formal proof is needed for the arguments in Sect. 13.11 Furthermore, 
Sect. ISlstudies only maximisation problems — research should be carried out for minimisation 
problems as well. Complete problems should be discovered for the respective subclasses. 

Since the decision version of the weighted MAXFLOW problem (where arc capacity can be 
any non- negative integer) is complete for the class P |GHR95l IImm99j . the optimisation 
version of weighted MAXFLOW is likely to be a complete problem for P' — this is yet to be 
proven. 
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